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Abstract 

A finite dimensional quantum mechanical system is modeled by a den- 
sity p, a trace one, positive semi-definite matrix on a suitable tensor prod- 
uct space H^ N K For the system to demonstrate experimentally certain 
non-classical behavior, p cannot be in S, a closed convex set of densities 
whose extreme points have a specificed tensor product form. Two math- 
ematical problems in the quantum computing literature arise from this 
context: (1) the determination whether a given p is in S and (2) a mea- 
sure of the "entanglement" of such a p in terms of its distance from S. In 
this paper we describe these two problems in detail for a linear algebra 
audience, discuss some recent results from the quantum computing liter- 
ature, and prove some new results. We emphasize the roles of densities p 
as both operators on the Hilbert space H' 1 ^ and also as points in a real 
Hilbert space M. We are able to compute the nearest separable densities 
ro to po in particular classes of inseparable densities and we use the Eu- 
clidean distance between the two in M to quantify the entanglement of po- 
We also show the role of to in the construction of separating hyperplanes, 
so-called entanglement witnesses in the quantum computing literature. 
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1 Introduction 



The idea of using quantum mechanical systems as computing devices arose dur- 
ing the early 1980s, and examples of the theoretical efficacy of such devices were 
soon developed. However, the subject remained primarily a topic in the theo- 
retical computer science and physics communities until 1994 when Peter Shor 
published a quantum algorithm for factoring a large composite integer N. Since 
his algorithm was polynomial rather than exponential in the number of digits 
of N, it showed that a prospective quantum computer could factor N more 
efficiently than was (or is) known to be possible on a classical computer. As a 
result quantum versions of algorithms, information theory and computational 
complexity have became subjects of widespread theoretical study, and efforts 
to actually construct physical systems which could serve as components of a 
quantum computer have become a recognized and active part of experimental 
physics. 

By its very nature, the field of quantum computation and quantum infor- 
mation theory is highly interdisciplinary and intersects with a variety of sub- 
specialities in mathematics, computer science, physics and even in philosophy. 
The purpose of this paper is to describe one particular problem in the field of 
quantum computation which should be of particular interest to the linear alge- 
bra community, and the rest of the paper is devoted to a mathematical overview 
of this topic and to the presentation of some new results. For the reader who 
would like more background in the subject of quantum computation, is 
an early survey article while [EXj, |24|, contain descriptions of the subject 
and additional references. Other introductory material can be found on various 
web sites such as that maintained by the Centre for Quantum Computation at 
Oxford University fj] . 

In the next section we give the mathematical notation necessary to describe 
the separability problem, which is related to the physical problem of construct- 
ing a system that produces non-classical phenomena. Essentially, the mathe- 
matical context is one of two nested compact convex sets and the determination 
whether a point in the larger set is in the smaller set. In section 3 we briefly 
describe the issue of quantifying non-separability or "entanglement" and settle 
on a measure which is the Euclidean distance of a point to the smaller con- 
vex set. In section 4 we present some basic topological results related to the 
separability problem, and in section 5 we develop the role of orthogonality in 
the analysis. Section 6 deals with separating hyperplanes, called entanglement 
witnesses in the quantum computing literature, and relates them to the earlier 
analysis. The last two sections deal with very specific situations in which all 
of the computations can be carried out explicitly and which were motivated by 
basic examples in the quantum mechanics literature. 

For those familiar with related work in the quantum computing commu- 
nity, we have emphasized convexity and the geometry of the underlying Hilbert 
space to provide a useful perspective of the separability problem and related 
topics such as entanglement witnesses. We have also shown how the resulting 
geometric insight facilitates the extension of results in |39] as well as the ex- 
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plicit computation of the nearest separable density to certain given inseparable 
densities. 

2 Notation and the separability problem 

Here is the context. Let denote an N = d\ x • • • x d n dimensional complex 

Hilbert space defined as the tensor product <g> • • • <g> H^ dn \ M is the real 

Hilbert space of N x N Hcrmitian matrices over if W with a real inner product 
defined by 

((A,B)) = Tr (A*B) = J2 = E (fhM ^ 

j,k j,k 

which is independent of the particular orthogonal basis of ffW used to define 
the matrix elements. D denotes the compact, convex subset of densities; that is, 
p in D is a positive semidefinite, trace one, N x N Hermitian matrix which can 
be interpreted as the state of an n-particle system where the fc'th particle has 
dk levels. The separable states (densities) comprise a compact convex subset 
S of D, and S is defined as the closed convex hull of the separable projections 
<S>fc \ipk) (ipk\- 

In this paper we will consistently use Dirac notation, so that a ket \tjjk) 
denotes a column vector in the dk dimensional Hilbert space fj( dfc ) and the bra 
(ipk\ is a (^fe-long row vector whose entries are the complex conjugates of those 
of \ipk) ■ The outer product (V'fcl is a ran k 1, dk x dk matrix, and the inner 
product of \ip) and (<p\ is denoted by the bracket (<p\ip)- In the physics literature 
the term pure state is sometimes used for both the rank one density \tp) (ip\ and 
the ket \ip) . Usually the meaning is clear from the context. (For an introduction 
to this notation in the context of quantum computing, see for example pi|.) 

It follows that the densities in D and 5 are both operators on the Hilbert 
space H( N ^ and also points in a closed convex set in a real Hilbert space M. It 
is that dual role which underlies our analysis. 

When the system of n particles is modelled by densities not in S, some 
striking quantum effects can be observed. Thus, physical experiments need to 
be designed so that the resulting density is in D — S = D n S c . The related 
separability problem is the mathematical question of how to determine if a given 
density p is in S. 

This is not an easy question to answer in this generality, and a simple exam- 
ple illustrates the difficulty. Consider a system with two 2-level particles, that 
is to say two "quantum bits" or qubits, so that N = 4 and = ® H^ 2 \ 
For this example choose p — \ipo) (i/)q\ where |V>o) = ^ ( 1 00) + |11)) and we 
are using the usual binary notation for two level systems. Thus, pa is a 4 x 4 
matrix with 1/2 in the four corners and 0's elsewhere. 

Define p (s) = (1 — s) Dq + spo, where here and throughout the paper we let 
the density Dq denote the "normalized" identity of suitable dimension, j^I. It 
is easy to show that p (0) = Dq is in S and, since po is a projection and thus an 
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extreme point in D, that p(l) = po is not in S. Thus there is an intermediate 
value So such that p (so) is in S and p(s) ^ S for so < s < 1. 

Peres ( Q ) has observed that a necessary condition for a density to be sepa- 
rable is that its partial transposes are densities, where the s'th partial transpose 
of a density p in a given basis respecting the tensor product is defined by 

P ta (ii • ■ ■ is ■ ■ ■ in, h . . . k s . . . k„) = p (ji . . . k s . . .j n , k x . . . j s . . . k n ) . (2.2) 

(Technically we probably should call this part of a generalized Peres condition, 
but that seems a bit fussy.) The necessity of the Peres condition is easy to 
confirm. If p s is a trace one, positive semidefinite matrix on H^ ds \ then so is 
its (ordinary) transpose p*. It then follows that the s'th partial transform of the 
separable density pi ® • • ■ ® p n is separable, and thus if p is a convex combination 
of separable densities, p = ^2 a p a Pa, the s'th partial transpose of p is also in S. 
In fact for the 2x2 and 2x3 tensor product cases the Peres condition is also 
sufficient (|ll|). Using that result it easy to see that for the two qubit example 
so = 1/3, and in fact it is possible to find an explicit separable representation of 
p (1/3), as noted below and in Q for example. (Related and earlier references 
include || |(]. [It], p|.) We discuss this example in more detail below, but suffice 
it to say here that the eigenvalues of p (1/3) are strictly positive so that p (1/3) 
is in the relative interior of D. 

3 Measures of entanglement 

A second theme of recent research has been to find a way to quantify the non- 
separability or entanglement of a system with density p. There has also been 
extensive work in this area, and some representative papers describing various 
approaches and basic properties which an entanglement measure should possess 
include (|, ||, [l2| ||^, Q ||(| among others. The motivation for such a 
measure is the recognition that entanglement constitutes a resource which can 
be used operationally in communications. A prime example is teleportation in 
which two different parties who share the state |-0o) = ^= ( |00) + |11)) are able 
to transfer an arbitrary quantum state a |0) + /3 |1) from one party to the other 
using classical communication and "local" operations. Figuratively this means 
that with c?i = 2 is identified with one party, typically denoted as Alice, 

while 7?( d2 ) with d,2 = 2 is identified with a second party, typically denoted 
as Bob. Alice and Bob can perform operations on their own components of 
Ij( N ) — fj( d i) (gj fj( d 2) anc j can communicate classically which operations they 
performed and whatever information they obtained from their operations. 

An example of teleportation is the following. An arbitrary quantum state 
|^>) = q |0) + /3|1) in a third Hilbert space is available to Alice. Without 
actually knowing \ip) she performs operations on that state and on H^ 1 ^ and, 
using classical communication, transmits the results of a measurement to Bob 
who can then recreate \ip) in H^ d2 \ again without knowing (For references 
and discussions see for example p|, |3|, ^l[.) 
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Two of the measures of entanglement for bipartite states which have been 
motivated in part by teleportation are the measure of formation and the measure 
of distillation (see for example [|| [|, ) . The respective contexts concern the 
creation of a state from pure states and "distilling" the maximum number of 
entangled states of the form \ip ) — ( 1 00) + |11)) from copies of a given 
density p. In a sense these particular measures can be considered as operational 
measures, since they deal with the creation of a mixed state or the extraction 
of maximally entangled pairs. 

The definition of a measure of entanglement as an infimum of "distance" 
from S was introduced in Q with an expanded discussion in Q. Other au- 
thors, particularly J33|, |3(j], took an axiomatic approach and also discussed basic 
properties that such a measure should possess, motivated in part by interpreting 
the operations in teleportation and distillation as mappings of densities in M. 
(For an alternate approach to entanglement using "robustness of entanglement" 
see J§.) 

As a paradigm based on |34| , we give the motivation (and terminolgy) for 
requiring that a measure of entanglement E satisfy the following three proper- 
ties: 

(a) E[p] = if and only if p e S, 
(b) E [UpU^] = E [p] where U is a local unitary mapping, 
(c) E (p)] < E [p] where $ is a local, completely positive, 
trace preserving operator on D. 

The motivation for the first property is obvious if one is measuring non-separability. 
A local unitary mapping U is a tensor product of unitary maps on the constituent 
product spaces and models the unitary transformations, such as a change of lo- 
cal basis, that could be taken independently on the individual spaces. Property 
(b) requires that entanglement should remain the same under such mappings. A 
completely positive trace preserving operator $ on D models the measurement 
process and can be represented |3(J as 

* (p) = E Vk p y k where E V H v * = L 

k k 

It is easy to confirm that $ maps D into D. Locality is imposed by either 
assuming Vk is a tensor product of operators on the constituent spaces or else 
by simply assuming that <E> also maps S into S. The point of axiom (c) is that 
one should not be able to increase entanglement under local operations. 

We mentioned above that the normalized identity Dq is a separable state, and 
it is obvious that Do can be written as an (equally weighted) convex combination 
of any N orthogonal projections, each of which could be entangled. For example, 
in the two qubit case Dq can be written as the average of the densities defined 
by the four orthogonal "Bell states" ^ (|00) ± |11)) and J= (|01) ± |10». Thus 
one could expect that a measure of entanglement would recognize the decrease 
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of entanglement under convex combinations and satisfy 

(d) E^PaPa] <J>a£[Pa]. 

This property is not a standard requirement, although it is just the triangle 
inequality for distances, and it can be shown [B5| that some of the proposed 
measures automatically satisfy (d). 

Since our goal here is to gain insight into the geometry of the separable densi- 
ties, we will not give a complete account of the various measures of entanglement 
which have been proposed but instead will use the measure of non-separability 
which comes naturally from the Hilbert space structure of M. Invoking the idea 
of minimal distance from S |3q] , we use the Frobenius or Hilbert- Schmidt norm 
and define a measure of entanglement as the minimal distance of a density p 
from the set of separable states: 



m{p}=mf\\p-T\\ = mf J Tr ((p- rf). (3.1) 

This measure has been already been considered as a possible measure of en- 
tanglement by other authors such as [|l9| [55], E39|, but since it does not seem 
to relate to operational uses of entanglement between parties, it has not been 
widely used. However, it is easy to show that m satisfies most of the properties 
discussed above. For example m [p] = if and only if p is in S, and it is easy to 
check that m [U pC/ -1 ] = m [p] for all local unitary operations, m also satisfies 
(d). Let p = ~Yl, a Papa and suppose m [p a ] = \\p a — r a \\. Then by the definition 
and the triangle inequality 

m[p\ < |E Po ( pa ~ r °) - ^2 Pa W Pa ~~ Tq II = E Pam • 

Although it does not seem to be known whether m satisfies condition (c) 
as stated, see |2^, [39| for discussions of this point, m does satisfy a special 
case of (c) when $ models a von Neumann measurement. Specifically, we also 
assume that {Vk} is a complete set of orthogonal projections which map S to S 
and let p = <I> (p) . If t is the nearest separable density to p, then 

||$ (p)-<S>(T )\\ 2 = Tr[Y / T, V ^P- T o) V J V ^P- T o)V^ 

\ 3 k 

= Tr (Jy2 V Ap-To)V^j -To) 

< ||$ (p)-$ (t )\\-\\p -T ||. 

Thus m[p] < ||$ (p) — $ (to) II < ||p — To|| = m [p] as advertised. Since the 
inner product structure of M also gives geometric insights to aspects of the 
separability problem, as shown for example in Witte and Truck's paper p9| , we 
shall use m as the measure of choice in this paper. 
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As a final remark on the issue of measures of entanglement, we note that 
Vedral and Plenio |35| suggested that condition (c) be replaced by 



(c') Y,PkE[ Pk ]<E[ P ] 

k 

where the p k s are particular densities derived from p via 
Pk = —VkpVl with Pk = Tr (Vkpvf) • 

Pk v > 

They give a reasonable motivation for this stronger condition, but it should be 
noted that if p = pkPk to begin with and if $ is a von Neumann measurement 
leaving each p k unchanged, then if E also satisfies the convexity property (d), 
E has to be linear in this case: 

E[p}=J2PkE[pk]- 
k 

A measure based on relative entropy does satisfy this condition J35|, but it is 
too strong a condition for the Frobenius norm. 



4 Basic Theory 

In each of the n Hilbert spaces defining ffW we can define an orthogonal 

basis which arises from the physical properties of the dfe-level system we are 
modelling. In the quantum computation literature this is called the computa- 
tional basis, and tensor products of these basis vectors define a basis for H^ N \ 
If we define projection operators on each of the H^ dk \ then their tensor prod- 
ucts are the separable projections (g)fc \ijjk) (^fc| whose convex hull is S. More 
generally, if we take a basis for the d\ dimensional space of linear operators on 
H^ dk \ then their tensor products define a basis for the N 2 dimensional space 
M. 

Now it was shown in |2^| that one can take what amounts to a discrete 
Fourier transform of a suitable arrangement of such product basis matrices and 
obtain a particular orthogonal unitary basis \S{j,k) \ f° r M which is indexed 
by pairs (j, k) of n-long vectors j, k. Using coordinate-wise addition, the set of 
indices defines an Abelian group G of order N, and {S^fc)} turns out to be a 
projective representation of G x G. The Su are unitary matrices and need 
not be in M; rather they serve as a basis of N x N matrices over the complex 
numbers. The reader is referred to |2(| for details of the construction, and we 
limit ourselves here to recording the results we need. (See We also 

note that Werner J38| shows the close connections among orthogonal unitary 
bases, dense coding and teleportation, all topics of great interest in quantum 
computing.) 

Using e to denote (0,0) and a as a generic index (j,k), the unitary ma- 
trices in {S a } have the following properties: (1) S e = I, the N x N iden- 
tity, (2) Tr (S^Sh) = N5(a,b), and (3) S a has the spectral representation 
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Ylk ^a,kPa,k> where the P a ,k are separable orthogonal projections. Since S a 
is unitary, ^2 k P a ,k — I and |A 0) k| = 1. Since {S a } is a basis, a density p in D 
can be expressed as ^ a s a S ai and the last particular property is that \s a \ < 1 
and s e = 1. 

It has been shown in a number of papers, initially in and also in ^ [l9| 
for example, that there is an open neighborhood of the normalized identity Do 
which is composed entirely of separable densities. Using the properties of {S a } 
we give a short proof. 

Proposition 4.1 If p is a density with X) Q ^e l s °l — 1> then p is separable. In 
particular, there exists an open neighborhood of Dq composed of separable states. 



proof: Since p is Hermitian, p 
listed above we have 



p + pt). Using the various properties 



1 

N 
1 

N 



Sr 



1 



2^ 



a^e k 



\ a^e J a^e k 



+ COS (9 a ,k))P a ,k, 



where \ {s a \ a ,k + s a X a ,k) = \s a \ cos (0 a ,k)- Since l + cos(6» aifc ) > 0, we have 
written p explicitly as a convex combination of separable densities, and thus p 
is in S. For the last assertion the condition l s al < 1 defines a relatively 

open set in D, □ 

In order to determine if an individual density is separable using this criterion, 
one has to compute each of the coefficients s a . Two weaker but user friendly 
corollaries are immediate consequences, however. 



Corollary 4.2 If e < (N 2 

open set of densities in S . 



l) \ then {(1 



e) Dq + ecr, a G D} is a relatively 



Proof: Let fi denote a density (1 — e) Dq + ecr, so that the a/e coefficient 
of p is es a where s a is the corresponding coefficient of a. Since e < (iV 2 — l) , 

J2 a =j£e € \ Sa \ < 1 aim M i s separable. □ 

Generally speaking it does not appear that the eigenvalues and eigenvectors 
of a density are useful in distinguishing a separable from a non-separable state. 
One counterexample is the following result which is not particularly strong but 
which has an easy proof. 



Corollary 4.3 If the smallest eigenvalue of a density p is at least 



t = 



N 



N+t 



whe 



N 2 



■, then p is separable. 
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Proof: We can use the spectral representation of p, obtaining 

k 



l—Y^aM (^ k \ + JL-D , 



N + t 



where < a k 
unitary basis 



X k {N+t)-l 
t 



Thus p = J2k a k IV'fc) ("0fc| is a density. In the 



N 



N + t 



where the s a are the {S a } coefficients of p. Then 

Sew< * 



AT 



(A 2 - 1) < 1 



showing that p is separable. □ 

As an example of Proposition 4.1, one can use the definition of the S^'s as in 
[p5| to show that the s a coefficients of the two qubit density p (s) defined above 
satisfy X) Q ^e I s "! = 3s. Thus s < ^ is also a sufficient condition for separability, 
and one does not need the Horodecki-Peres result. Our main application of 
the preceding proposition, however, is to characterize densities in the relative 
interiors of S and D. 



Proposition 4.4 A density p in S is in the relative interior of S if and only if 
there exists a t > such that (1 + 1) p — tDo is in S. The same assertion holds 
if S is replaced by D throughout. 

Proof: Suppose that p — (1 + 1) p — tDo is in S. Then for any a in D 
^+ I ^ t (Do + e(a-D ))=p+^ rt (o--D ) 

is also in S provided e < (iV 2 — l) . Conversely, if p is in the relative interior 
of S, then for small S, p + 5 (a — Dq) is in S for all a in D so that choosing 
a = p gives a separable density p = (1 + 5) p — SDq. The same proof works if S 
is replaced by D. □ 

The use of a line segment connecting a density with the normalized identity 
Dq turns out to be a helpful tool in the analysis. Accordingly we shall refer 
to (1 + 1) p — tDo as an entanglement probe and note that Vidal and Tarrach 
[p3| made extensive use of entanglement probes in defining and investigating a 
"robustness" of entanglement for densities. Two easy applications show both 
the utility of entanglement probes and the contrast between S and D. 
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Corollary 4.5 A density p is on the boundary of D if and only if p has a zero 
eigenvalue. If p is in S and rank(p) < N , then p is on the boundary of S and 
also of D . 

Proof: If p \tp) = 0\ip), then (1 + 1) \p\ t/>) - t (t/> \D Q \ ip) < and /i = 
(1 + 1) p — tDg is not positive semidefinite. Conversely, if the eigenvalues of p 
are bounded below by s > 0, then p can be written as 

p = (1 - sN) \^k) {M + sNDo 

k 

and since sN < 1, p is in the interior of D. If p is in S 1 and rank(p) < iV, the 
same proof shows that p is on the boundary of D and thus also of S. □ 

There is a class of densities which satisfy the Peres partial transform con- 
dition but which are not separable, and there have been a number of detailed 
investigations of these densities using rather different techniques than those de- 
scribed above. For an introduction and additional references see Lewenstein et 
al in || . 

5 Orthogonality 

In our running example we have seen that p (|) on <g> is the closest 
separable density to the inseparable density po along the line connecting Dq 
and po- As it happens, p (^) is also closest to po in the norm || || defined in 
equation (3.1). To see this we need an alternate characterization of the density 
in S closest to a density p in D — S . This characterization is a standard result 
in convexity theory and has been used in jl9[ [39| for example. 

Proposition 5.1 Suppose p in D — S . Then r is the unique closest separable 
density to p if and only if for all r in S 

({p - r ,r - r )) = Tr ((p - r ) (r - r )) < 0. (5.2) 

By the convexity of S , it suffices to prove the inequality for all separable projec- 
tions T . 

Proof: Adding and subtracting ro gives 

({p-T,p-T)) = ((p- r ,p-T )) -2((p-T 0> T-T )) + ((t -t,t -t)) 

which shows (5.1) is sufficient. Conversely, if ((p — a, p — a)) is minimal over S 
when a — to, then ((p — tq, a — ro)) < \ ((to — <r, To — a)). Using the convexity 
of the separable states, let a = (1 — t) r + tr with < t < 1, where t is in S. 
It follows that 

((p -t ,t - t )) < i (1 -t) ((t - t,t - t)) , 
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and letting t go to one gives the result. If t\ also minimizes ((p — r, p — r)) , 
then from ((p — tq, t\ — tq)) < and {{p — t%,tq — t%)) < 0, we can conclude 
that ((n — to,ti ~ r o)) < 0, confirming uniqueness and completing the proof. 
□ 

An extremely useful geometric entity is the separable face nearest a given po 
in D — S. Let r denote the nearest separable density to po and use the notation 
of the proposition above. 

Definition 5.3 F (po, tq) denotes {r E S : (po — To, r — To) = 0} . 

Thus -F (po, To) is the convex set of densities in S such that as vectors r — To 
is perpendicular to po — To. We leave it to the reader to confirm that F (po, tq) 
is indeed a face of 5 and that the extreme separable projections in a convex 
representation of To necessarily lie in F (po, To). 

The alternate characterization of To allows us to compute the nearest sep- 
arable density in some cases, and we pursue that idea next. As an example, 
the following result includes Proposition 1 of ||{| as a special case in which the 
density pi below is separable and equal to a density in F = F (po, To). 

Corollary 5.4 Suppose t and T\, the nearest separable densities to po and 
Pi respectively, are both in F — F(po,To). Then the nearest separable density 
to tpo + (1 — t) pi is t (t) = tTo + (1 — t) Ti, and thus m [tpo + (1 — t) pi] < 
tm [p ] + (1 - t) m [pi]. 

Proof: Since ((p - t , t (t) - t )) = ((p - n,T (t) - Tj,}) = 0, we have 



completing the proof. □ 

As another application, we are able to give a geometric perspective to To (d), 
the separable density closest to the bipartite state po (d) = \ipd) (V'dl where 



This includes the motivating example as a special case. The state po (d) is 
known as a maximally entangled state and for d — 2 was used by Werner [B7| 
in an analysis of "local reality" and the Einstein, Podolsky, Rosen paradox 
[^|. Now the convex combination (1 — s) Dq + spo (d) can be interpreted as a 
mixture of the maximally entangled state po (d) and Dq, the "maximally mixed" 
state or random noise. In earlier studies, including |27| and |l7) and references 
therein, the largest value of s for which (1 — s) Dq + spo (d) is separable was 
investigated, which is equivalent to the question of how much noise it takes to 
make the system unentangled. As it happens, the nearest separable density to 



((tp + (l-t) Pl -T(t),T-T(t))) 

t ((po -To,T-T (t))) + {l-t) (( Pl T\,T T (t)» 
* <(p - T , T - T )) + (1 - t) «pi - Tl, T - Ti )) < 0, 
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po (d) is such a convex combination. That result for d = 2 seems to have been 
noticed first in and the proof below for arbitrary d follows their approach. 
(An independent proof of the general case recently appeared as part of the 
analysis in p9|.) 

Proposition 5.5 The state tq (d) — (1 — s d ) -Do + $dPo (d) with s d = (1 + d) 1 
is the nearest separable density to the maximally entangled state po {d) . The 
analogous assertion is false if the number of product states n is bigger than 2. 

Proof: The proof that tq (<f) is separable has been given in a number of 
references such as || [l?], |sj among others. Dropping explicit mention of d, 
we thus need to prove that for all separable projections r 

((po - t ,t- T )) = ((po -D ,t- Tq)) = ((pO,T- T )) < 0. 



(1 - 8 d ) 



d 1 



First, Tr (\^ d ) (^| T ) = (1 - s d ) Tr (i \^ d ) (M) +s A Tr (\i> d ) - TOT 1 
= 2- If T is a separable projection, then t ~ \a) (a\ (8 |/3) (f3\ where in the 

computational basis \a) = X)fc=o a k l&) with ^2 k \ak\ 2 — 1 and with an analogous 
expression for |/3). Then 



Tr(\rp d ) {i> d \r) = X - 





2 


1 

d 




2 








< 













1 

d 



since E/c a fc^fcl — zCfc l a fcl zC/t by the Cauchy-Schwarz inequality. Hence 
((po — To, t — To)) < for all separable densities and To (d) is the closest sep- 
arable density to po (d) when n — 2. When n > 2 and \ip d (n)) is defined 
analogously, Tr (\ip d (n)) (ip d (n) \ t ) equals gnr^l=T) an d it is easy to see that 

there are separable projections with Tr (\ip d (n)) (ip d (n)| t) = ^, completing the 
proof of the proposition. □ 



Corollary 5.6 Using m as the measure of entanglement, m [po (d)] = Jl — g^j, 
so that entanglement increases with increasing d. □ 

As another application, we can compute explicitly the extreme points of 

F( P0 (d),T (d)). 

Corollary 5.7 F (po (d) , To (d)) is the convex hull of \a) (a\ (g) |a) (a\, where the 
bar denotes the complex conjugate of the entries of the row or column vector. 

Proof: From an earlier observation, it suffices to consider densities of the 
form t = \a) (a\ <g> \(3) ((3\. From the proof above, t is in F (po (d) ,To (d)) if 

^ 2 ^ 2 ^ l — 1 2 

and only if \ J2k a kbk\ = Sfe l a fel Sfe \bk\ ■ This is the case of equality in the 
Cauchy-Schwarz inequality over the complex numbers and is equivalent to bk — 
ca,k for some constant c and all k. (See for example p9[.) By the normalization 
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condition, |c| = 1 and thus does not appear as a factor in r = \a) (a\ ® \a) (a\, 
completing the proof. □ 

As an example, when d = n = 2, the basis of orthogonal unitary matrices 

{5 a } defined earlier is essentially the set of four Pauli matrices: Co = ^ J ^ 
1 \ /0 1\ , / -i 



.ji."! - i j i > and = ^ . Q ). (The only dif- 
ference is that one uses ^ ^ q j m ueu °f °V) ^ ^ s eas y to show that 

to = /o(^) is the average of the six separable projections | (ctq ± er z )<g) (er ± cr z ), 
j (ao ± cr x ) <S> (ctq ± er x ), and j (ctq ± Cj/) <8> (ctq T Cy) an d that these projections 



have the requisite form \a) (a\ <8> \ce) {a\. 



6 Entanglement witnesses 

Suppose po ^ S. Then a standard consequence of the Hahn-Banach theorem 
for convex spaces is that there exists a linear functional F on M such that 
F (po) < < inf [F (t) , t € S). In the context of our finite dimensional Hilbert 
space M, the Riesz representation theorem says that each linear functional is of 
the form F (p) = Tr (Ap) for some Hermitian matrix A. (See for example Q.) 
It is also a standard fact that the hyperplane M (A) = {B G M :Tr (AB) = 0} 
has dimension dim (M) — 1, so that one can view M (Aq) as a separating hy- 
perplane with po on one side and S on the other. Since quantum mechanical 
observables are modelled as Hermitian matrices, the thrust of the theory is that 
the condition p £ S can be "witnessed" by a suitable observable A, and such 
Hermitian matrices have been dubbed "entanglement witnesses" in the quan- 
tum computation literature. This connection was first pointed out in and 
the authors went on to link these ideas to the Banach algebra literature. In 
particular they showed that the Peres necessary condition for separability is 
also sufficient in the 2x2 and 2x3 tensor product cases. 

The Peres condition can be couched in the language of positive operators 
on bounded functions on H^ N \ and, as mentioned in the introduction, one 
direction of research on separability has focused on densities which satisfy the 
Peres condition but which are not separable. A consequence of that work has 
been a study of entanglement witnesses in general. Recent relevant papers which 
contain further references include |l4|, |l^, |3lj, [52| . 

Since separating hyperplanes are not unique, it is customary to normalize in 
the entanglement context by requiring that Tr (ADq) = 1 in addition to 

Tr (Ap ) < < inf [Tr ( At) ,t e S] (6.1) 

for some inseparable density po ■ In [[l6[ the authors introduced a partial order on 
such entanglement witnesses as follows. Let D (A) denote {p £ D : Tr (Ap) < 0}. 
Define a partial order by A ^ B if and only if D (A) C D (B). Then an optimal 
entanglement witness is a maximal element in the partial order. We should note 
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that the analysis in |l6|] deals with general entanglement witnesses, and one of 
the sufficient conditions below for Aq to be optimal appears there. 

The connection with our analysis is that knowing the closest separable den- 
sity To to a nonseparable density p$ also enables one to construct an entangle- 
ment witness Aq for a class of densities related to pq. This is not an assertion 
that actually finding tq is computationally easy. Rather, it illustrates the im- 
portance of To and shows Tr (Aqo~) has a familiar form which further reveals its 
geometric character. 

We assume equation (6.1), but since we begin with a particular pq we use a 
slightly different normalization. 

Definition 6.2 Aq is said to be optimal provided that any Hermitian A satis- 
fying equation (6.1) together with Tr(Apo) = Tt(AqPq) and D (Aq) C D (A) 
necessarily equals Aq. 

Theorem 6.3 Suppose r is the nearest separable density to a non-separable 
density p . Then the Hermitian matrix Aq = CqI+Tq—Pq with Cq = Tr (tq (pq — Tq)) 
is an entanglement witness for p . In particular for any density a 

Tr(A a) = -(( Po -T ,a-T )) (6.4) 

so that the separating hyperplane defined by Aq contains F(pq,tq). If some r 
in F( P q,tq) has full rank, then Aq is optimal. 

Proof: It is easy to check equation (6.2) so that Aq has the asserted prop- 
erties. Next, suppose that A satisfies equation (6.1) and that D (Aq) C D (A) 
with Tr(Apo) — Tr(AQpo). Using one of the techniques in Q, suppose that 
Tr(A a p) = 0. Then for < s < 1, Tr (A a ((1 - s) p + sp )) < implying 
Tr (Ap) < j^Tr (Ap ) and thus Tr (Ap) < 0. In particular Tr (At) < for r 
in F (pq,t ), forcing Tr (At) — 0. 

Now suppose that there is a r in F (p , tq) with full rank, so that its smallest 
eigenvalue is strictly positive. Then it's straightforward to show that there exists 
a small positive t such that for any density p, p(t) = (1 + 1) r — tp, a variant of 
the entanglement probes defined earlier, is in D. In particular if Tr (Aqp) = 
we have 

Tr (Aqp (t)) = 0>Tr (Ap (t)) = -tTr (Ap) > 

forcing Tr (Ap) — 0. This gives the property that Tr (Aqp) ~ implies 
Tr (Ap) = which suffices for the rest of the proof. In fact, that property 
together with equation (6.1) and Tr (Ap ) = Tr (A p ) is equivalent to A = Aq. 

Suppose Tr(A p) > 0. Then Tr (A ((1 - s) p + sp )) = for some s in 
(0, 1), and from the normalization it follows that Tr (A p) = Tr (Ap). In par- 
ticular Tr (AqDq) = Tr(ADo). Finally, if Tr (A p) < 0, then analogously 
Tr (A ((1 - s)p+ sD )) = for some s in (0, 1), and that gives Tr (A p) = 
Tr (Ap). Consequently Tr (Aqp) = Tr (Ap) for all p in D, and it follows that 
Aq = A, completing the proof. □ 
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Corollary 6.5 Aq is optimal if the separable eigenvectors of the rank one sep- 
arable projections in F(pq,tq) span H^ N ' or if there is a density p of full rank 
such that p = (1 + t)r — tpo for some r in F (/do, To). 

Proof: In the first case, it is easy to see that one can construct a r in 
F(pq,tq) which has full rank. In the second case, the techniques in Corollary 
(4.3) show that r has full rank. □ 

As an example of this general theory we have the following specific result 
which includes example 5, up to a multiplicative constant, in [p2| . 

Corollary 6.6 For the usual 2x2 bivariate example, r = p (1/3) has full rank 
and 



A = -(I- 2p Q ) 



/ -1 \ 

10 

10 

\ -1 / 



is an optimal entanglement witness. D (Aq) contains p a — \ip a ) (V'al f or an V 
density of the form p a = \ip a ) (ip a \ where \ip a ) = X)fc=o flfe 1^) w ^ non-negative 
ak such that ^ fc |afc| 2 = 1. In the corresponding dx d case, Aq = (I ~ dpo)- 
□ 



If the matrix above is denoted as Max , the Aq in the d x d case turns out 



to be a multiple of 



Mjk, where the Mjk have definitions analogous 



0<j<fc<d lvl jk, 

to Mqi. (See equation (7.4).) Another role for the Mjk is given below, where 
we find other nearest separable states using an extension of the methodology 
developed above. 



7 Variations in the bivariate case. 

It would be useful to be able to calculate m [p] , the Frobenius measure of en- 
tanglement of states other than the maximally entangled states, and we can do 
this for states which are near to the maximally entangled state in a sense to be 
made more precise below. We will use the geometric insights obtained above in 
the context of two d-level systems and motivate the analysis with the usual two 
qubit case d = 2. That particular case was also studied by Witte and Trucks 
|$9| who used a different approach to obtain Proposition 7.1 below. 

Our approach is motivated by the geometry. We know that pq — tq is 
orthogonal to the face F(pq,tq), and from Corollary 5.3 we also know what 
the extreme points of F(pq,tq) are. Now suppose that p a — \ip a ) (ijj a \ where 
ip a = J2k=o ak l^k) with < dk, J2k a k = 1j an d the et^'s are close to l/\fd. 
Then one would expect that r a would also lie in F (po, tq) and thus, considered 
as vectors, that p a — r a might be parallel to po — tq. This could take the form 

t q = p a + t (r - pa) (7.1) 
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where t — t(d, a) is a positive constant to be determined. This particular 
representation works when d = 2, and we obtain the same constraints on the 
parameters ao and a\ found earlier in |3jj]. As a convention, we will assume that 
cto > ai throughout. 



Proof: If r Q were in F(po,tq 



Proposition 7.2 In the case d = 2, r a = p a + t (to — po) fes m -F (po, To) cmd 
is t/ie closest separable density to p a provided t = 2aoa\ and | a.§ — | < The 
Frobenius measure of entanglement is then m [p a ] = 2aoai/V3 = 2aoa\m [po]. 

then ((po — TQ,T a — tq)) = and for any 



((Pa ~T a ,T- T a )) = t [({p - T0,T - T )) - ((p - Tq , T a - T ))] < 0, 

confirming that r a would be the closest separable density to p a . For 



/ a 2 - t/6 



Y aoai — t/3 





t/6 

t/6 





aoai — t/3 \ 



a\ - t/6 j 



to be in F (po, To) it has to be a convex combination of the extreme points of 
F(po,Ta). If the entries of are denoted by rke l8 ^ k \ then the (jij'2, /cifc2)'th 
component of t = |/3) ® \(3) (/3| is 

o i(e(j 1 )-e(j 2 )-fl(fc 1 )+e(J:2)) 



(7.3) 



It follows that Too, 11 = Tii^oo = T io,io = r oi,oi — r o r i m always real, and it is 
easy to see that all other entries have phase angles. Thus a necessary condition 
is aoai — t/3 = t/6, giving t — 2aoai as asserted. 

Keeping ro and r\ fixed and averaging extreme points with phase angles 
changed appropriately by angles of 7r/2 and 7r, one can eliminate non-zero entries 
where phase angles appear and thus define a convex subset F (po, t ) of F (p , tq) 
defined by densities of the form 



V 



r 4 
'0 








r\r\ 





r\r\ 














r^r( 





r 2 r\ 








r\ 



\ 



} 



A necessary and sufficient condition for r a to be in this convex subset of F (po, to) 

should be in the convex hull of vectors 



is that (ap 



aoai 
3 



aoai 
3 



2aoai 



(r^, rf, 2r\r\ ) with r§ + r\ — 1. If xq and x\ denote r\ and r\ respectively, 
then it is easy to check that an equivalent condition is that (ag,ao — 



should be in the convex hull of vectors [xq, XqJ where < xq < 1. But that set 
is precisely the set of pairs (x, y) with < x < 1 and x 2 < y < x. That means 
a-o < o-o ~ or 1 — 3 a o a i, which is equivalent to the condition asserted in the 
statement of the proposition. (Since d = 2, the same result could be obtained by 
using the required positive definiteness of r a and the Peres-Horodecki theorem.) 
The calculation of m [p a ] is immediate, completing the proof. □ 
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Corollary 7.4 If = §+ g ; thenr^ = I 7+ 1 3 8 ) andr a is on the boundary 
ofD. □ 

The significance of Corollary 7.1 turns out to be that for Oq > 5 + -^5 the 
vector p a — r a is in fact not parallel to po — to, and the techniques above do 
not give r a . This same problem arose in [ [39| where it was conjectured that r a 
could be computed using the root of a cubic polynomial. Geometrically, that 
cubic is based on the assumption that the nearest separable density to p a is on 
the boundary of F (po, To), which is reasonable since we have already seen that 
p a is separated from S by the hyperplane containing F (pa, To). The gap in the 
argument is that one needs to show that r a has to be in F (po, To). 

The same approach works for d > 3 but with the need for additional pa- 
rameters. The entries of the extreme points \0) (f3\ ® (J}\ have the same 
sort of pattern as in the d = 2 case with real entries r 2 r 2 in positions (jj, kk), 
(jk,jk), (kj, kj), and (kk.jj) for < j, k < d — l.This means there are (2) sets 
of four entries for which the components of any density in the convex hull of the 
extreme points must be constant, and the single parameter t in equation (5.1) 
doesn't suffice. 

To obtain more parameters, we look for additional Hermitian matrices or- 
thogonal to F (p , To). In particular, define the (^matrices Mjk, < j < k < d, 
whose entries are +1 at (jk,jk) and (kj,kj), —1 at entries (jj,kk) and (kk,jj) 
and are elsewhere. Then it is easy to check that for extreme points t in 

Tr (Mj k r) = Tjk,jk + T kj,k 3 — T 3 j.kk ~ Tkkjj = 0, (7.5) 

and thus each Hermitian Mjk is orthogonal to densities in F (po, To). Use / to 
denote the d 2 x d 2 identity and note that 

Y J M 3k =I-d Po . (7.6) 

j<k 

As noted in the preceding section, up to a multiplicative constant / — dpo is the 
optimal entanglement witness Aq based on po. 

We break the analysis into two parts, first showing that the r a defined below 
is the closest separable density to p a , assuming r a is in F(po,To), and then 
obtaining sufficient conditions on the a/c's for T a to be in F (po, tq). Recall that 
p a has entries a^ak in positions (jj,kk) and (kk,jj) for < j, k < d — 1, so 
that J2k a k = 1- As a convention we assume that ao > a\ > . . . > ad-\. Set 
a* a — ^2j <k o-jak and impose the first constraint on the (Zfc's: 

2 2a* a 

°^ - WTTy (77) 

Proposition 7.8 Let {Mjk} be defined as above and let M a denote Ylj<k u jkMjk- 
Define 

Ta= Pa+t(T - p a )+M a , (7.9) 
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where t = and Uj k — \ (aja k — 2) ■ Then X^ <fe uj k = 0, and r a is a trace 
one, Hermitian matrix with non-negative entries on the diagonal. Moreover, 
Pa — T a is orthogonal to F (p ,T ). If in addition r a is in F (po> T o); then it is 
the closest separable density to p a and 



m[p a ] 



Tr 



(Pa - T a y 



\ 



t 2 1- 



1 + d 



j<k 



(7.10) 



Proof: Using the definitions we first compute the non-zero entries of r : 



T a (a, a) 

T a (jk,jk) 

Ta (jj, kk) 



td 



2a * a 



1 1 

l + d\~dP~d) " l d(d+l) 
td 1 If 2a* a 

iTdd? +Ujk = 2{ ajak -d{dTT) 



J ^ k. 
2a * a 



td 1 

fl ' afc ~ TTd d Ujk = 2 [ a > ak d{dTT] 



3 + k. 



By virtue of equation (7.5) r has non-negative entries on the diagonal and 
the desired pattern of values on the remaining entries. Note that these same 
formulas work for d = 2. One can show Tr (r a ) = 1 directly or by confirming 
that 



E u i k = \ \ a * a 

j<k ^ 



d\ t 
21 d 







so that Tr (M„) = 0. Since Tr (M jk (po - td)) = j^Tr (M jk (p - D )) = =£, 

Tr (M a (p - T )) = ^ u J k = °- 

j<k 

Thus, as a vector p a — r a can be viewed as the sum of two orthogonal vectors 
in (F (p , tq))" 1 , the linear subspace of Hermitian matrices perpendicular to 
F {po,tq), and (7.7) follows from that observation. 

So far we have only shown that r a could be in F(p ,T ), with equation 
(7.5) the only constraint imposed so far on the a k s. To complete the proof 
of the proposition, we show that if r a is in F(po,To) then it is the nearest 
separable density to p a . Since ((po — to, r a — tq)) = ((Mjk,T a )) = under that 
hypothesis, 

((Pa-T a ,T-T a )) = t[((p -T ,T-T a ))}-^2u jk ((M jk ,T-T a )) 

3<k 

= t [((Po - to, t - r ))] - ^2 u jk ((M jk ,T)) 

j<k 

= jT-r ((Po - A),t-t )) -J2 u Jk (( M jk,T)) . 

j<k 
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Now it is easy to check that 

((po - D ,t-t )) = ((po, t)) - ((po,T )) = ((A), t)) - ^ = ^ ((dpo - I,t)) , 

where / denotes the d 2 x d 2 identity matrix. Using dpo — I = — J2j<k Mj k from 
equation (7.4) and 

Ujk ( (M jk , r) ) = X - ajdk ( (M jk , r)) - ^ ^ ( (M jfc , r) ) , 

we have 

•t(d-i) 



«Pa-r„,T-r„)) = -^X!^ M J' fc ' T )) \ ,/(,/- I) 

j<fc 



We have already shown that {{Mj k ,r)) — when r is in F(po,To). If t is a 
separable extreme point of the form \j3) (f3\ ® I7) (7I, then using the obvious 
notation and equation (7.3) 

((M jk ,r)) = r jkdk + Tkj,kj Tjj.kk T kk,jj 

= b 2 c 2 k + b k Cj - 2b 3 bkCjC k cos (9 jk ) > 0. 

Hence ((p a — r , t — T a )) < for all separable densities, and that completes the 
proof. □ 

As we saw in the case when d = 2, further restrictions on the a^'s are required 
to show that r a actually is a separable density. We examine that problem next 
and obtain sufficient conditions on the a k s for r a to be in F(p n ,To). It then 
follows from the foregoing analysis that r a is the separable density closest to 
the related density p a , confirming the intuition that motivated this analysis in 
the first place. Unfortunately the algebra appears to be too involved to get as 
precise a result as in the case when d — 2. 

What we do instead is demonstrate a methodology which shows that there 
exists a neighborhood of the equal entry case when a k = 1/ Vd in which r a is 
in F(po,To). We have already shown that r a could be in the smaller convex 
set F(p ,T ), and we follow the approach used in the d = 2 case. r a will be 
in F (po, To) if the ( d + X ) vectors T (a) whose first d entries are af — gp^h an d 

whose next Q entries are (aja k — ) , j 7^ fc, is in the convex hull of X (.t) 

vectors with respective entries and 2£j 1 a;j 2 , where X^i^i = 1 an( i — ^i- 
Note that the components of all of the vectors in question sum to 1. In this 
notation tq corresponds to a T- vector T (0) with entries jnppp, ■ 

The idea is to select a specific set of extreme X vectors and show that T (0) 
is in the interior of the convex hull of these particular vectors. Specifically, for 
each of 1 < k < d we choose (^) vectors X (k;j), 1 < j < ( fe ), corresponding 
to a choice of k of the x^s equal to 1/k and the remainder equal to 0. Thus, 
X (k;j) will have k of its entries corresponding to xf equal to 1/k 2 and (*) 
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entries corresponding to 2xi 1 Xi 2 equal to 2/k 2 . For example, if d = 3 the 
resulting 7 vectors can be written as column vectors in a 6 x 7 array V, and 
the assertion that T (0) is in the convex hull of these vectors is equivalent to 
Vp = T(0) or 



/ 1 






V o 



{ Po \ 

Pi 

P2 
P3 

Pi 
Pb 

\P6 J 



( h \ 



(7.11) 



with non-negative p/s summing to 1. Using this approach some easy linear 
algebra shows that a sufficient condition for r a to be in F(po,ro) when a > 
a\ > a 2 is that 



-a a 2 <a 2 - —a * a, 



(7.12) 



implying (7.5) when d 



3. As two examples, the a corresponding to ao 



y/3/12 satisfies this constraint, and the inequal- 



1/4/12, and a 2 
ity (7.9) when a k = ^ is g < {. 

In the general case V is a (d + x ^X^fe=i ©) matrix with regular 
structure in each of the d blocks of (^) columns. Each of the first d rows will 
have (fclj) non-zero entries equal to l/k 2 in the corresponding block of (f) 
columns. Similarly, each of the last (2) rows will have no positive entries in the 
first block of columns and {uZ.2) non-zero entries equal to 2/k 2 in the remaining 
blocks of column vectors. If we further require that each of the (^) column 

vectors have equal weight <?fc/(fc)> * nen s °l vm g f° r P m Vp = T (0) is equivalent 
to finding non- negative q k satisfying J^klk = 1 and 



d 1 



(tl) 



(izt) 



t^ k2 ^ d ^+^y 

It is then easy to show that those three equations are equivalent to 



^2 9k 

k=l 



1 



E 

k=2 



qk_ 

k 



d + l 



(7.13) 



Note that if 2k < d, then q k = kj (d+ 1), qd+i-k = (d + 1 — k) /(d + 1) and qj 
equals zero otherwise is a particular solution. From that observation it is easy 
to see that one can find solutions of (7.10) which are strictly positive. 

We have defined 2 d — 1 particular vectors X which span their { d 1 2 1 ) dimen- 
sional space. Given the components a k of a, we want to find p a , a (2 d — l)- 
long vector with non-negative components which sum to one and such that 
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Vpa — T{a), the vector corresponding to r a . Equivalcntly, we want to solve 
Vp a = Vp + V (p a - p ) = T (0) + (T (a) — T (0)). By the earlier analysis 
we know that we can choose a po whose entries are all strictly positive, and 
thus the problem reduces to finding solutions of Vx — T (a) — T (0) where the 
components of x sum to and are sufficiently small so that the components 
of p a = Pa + x are non-negative. Since 2 d — 1 > C^ 1 ), this is always possible, 
provided the components of T (a) — T (0) are also sufficiently small. This proves 
the final assertion of this section. 

Proposition 7.14 If the components of a are sufficiently close to l/Vd, 
then T a is in F (po, To). 

8 Orthogonality in the n qubit case 

In the absence of an efficient algorithm to compute the nearest separable density 
to a given po we have used the special structure of states near maximally entan- 
gled states to find To- In particular we found in Section 4 that in the bivariate 
case the nearest separable state to po (d) lay along the line in M connecting po 
to Do, and we also saw that was not true if there were more than two systems. 
In this section we work with n qubits and show that the special structure of 
po = po (n) = \ip ) (i>o\ where |Vo> = -75 (jOO . . .0) + |11 . . .1)) facilitates the 
analysis. In particular we will obtain some perspective on the geometry in this 
higher dimensional context. 

The approach is straight-forward. We use the structure of po as a matrix in 
the computational basis and consider the local unitary mappings which leave p 
invariant. Since such operations should also leave To invariant, we assume To will 
have non-zero entries only on the diagonal and in the (0, l) = (00 ... 0, 11 ... 1) 
and (1,0) = (11 ... 1, 00 ... 0) positions. Additional considerations of symmetry 
and positive definitcness reduce the calculation to a one variable problem which 
can be solved by minimizing ||po ~ T oli over the remaining free parameter. The 
result of that calculation provides a judicious guess for the form of t , and the 
work is in the verification. These results include the two qubit case which has 
r2 = 2 in the notation below. 

Theorem 8.1 For fixed n > 2 let r n = 2 n ~ 1 and let r denote the 2™ x 2 n 
matrix with entries equal to except for 



to (0, 0) 



TO (1,1) 



r 2 n - 2r n + 2 
2rl -2r n + 2 



to (0, 1) 



TO (1,6) 



1 



'n 



2r 2 n -2r n + 2 



and with all other entries on the diagonal also equal to b n . Then 



m [po] = ||po - To|| = -j= i 1 - 




n 



-r„ + l 



1 



) 



2 
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The extreme points of F (p ,T ) consist of |0)(5|, and projections of 

the form t = ®l =1 \^k) {^k\ where \ij) k ) = -±= (e^*/ 2 |0) + e^"/ 2 |1)) with 
$ = fk = modulo 2ir. 

Proof: The calculation of \\p — t | is routine, once we know that To is the 
closest separable density. Thus we want to show that Tr (A t) > for separable 
r when Aq = cqI + tq — po, and as usual it suffices to check the inequality 
for separable projections. A routine calculation of c = Tr (r (po — r )) gives 
c ° = ir^-2r +2 - ^ separable projection can be written as the tensor product of 
n matrices of the form 

7-2(0) r k (0) r k (1) 

r k (0)r k (l)e^«' 2 r 2 (1) 

and when we carry out the details we find that 
with 

F (r) = 1 - ]J r\ (0) - H r 2 (1) - (2" - 2) J] r fc (0) r fc (1) cos ($) (8.2) 

where $ = J^fe ^fc- Since r k (0) + (1) = 1, we can write the 1 in F (t) as the 
product of all n terms r\ (0) + r 2 (1). Subtracting J^fc rl (0) + Y\ k r\ (1) from 
that product leaves 2™ — 2 terms of the form J| fe r\ (j k ) where the binary indices 
j k are not all the same. These terms can be grouped in pairs so that each factor 
of r\ (0) and r\ (1) appears in exactly one of the two paired terms. Then F (r) 
can be written as the sum of 2 n_1 — 1 expressions of the form 



II r l O'fc) + II r l (~Jk) 2]Jr k (0) r k (1) cos ($) 



L k 



(8.3) 



where j k denotes the binary complement of j k . Since each of these expressions 
is non-negative, Tr (Aqt) > for separable S. 

Suppose F (t) = for r a separable projection. Then it's easy to check 
from equation (8.1) that if any one of the factors r k (0) — 1, all of the factors 
Tj (0) = 1 and |5) (5| is in F(p a ,T a ). Similar reasoning shows that |l) (l| is 
also in F (p , r ), and the only remaining case is when none of the factors equals 
zero. Since each expression in equation (8.2) must be zero, cos ($) = 1 and 

l[r k (j k ) = l[r k (J fe )^0 

k k 

for all n— tuples (ji, . . . ,j n ). But then it is easy to show that rj (0) = rj (1) = 
1 / V2 for all j , completing the characterization of the extreme points of F (p a , r ) 
and the proof of the theorem. □ 
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